中文ocr
2022年3月10日 — 中文ocr-Benchmarking Chinese Text Recognition: Datasets, Baselines, andan Empirical Study 原创 ... A large chinese text dataset in the wild.
CTW Dataset
In this paper, we introduce a very large Chinese text dataset in the wild. While optical character recognition (OCR) in document images is well studied and ...
esun-aitraditional-chinese-text-recogn
To the best of our knowledge, public datasets for Traditional Chinese text recognition are lacking. We generated over 20 million synthetic data and collected ...
FudanVIbenchmarking-chinese-text
This repository contains datasets and baselines for benchmarking Chinese text recognition. Please see the corresponding paper for more details regarding the ...
priyank
Chinese. Multilinguality: monolingual. Size Categories: 100K<n<1M. Tags: ocr text-recognition chinese · Dataset card Files Files and versions Community. Dataset ...
Chinese Text in the Wild Dataset
Chinese Text in the Wild is a dataset of Chinese text with about 1 million Chinese characters from 3850 unique ones annotated by experts in over 30000 ...
SCUT-HCCDoc
由 H Zhang 著作 · 2020 · 被引用 25 次 — In this paper, we introduce a large-scale dataset, called SCUT-HCCDoc, to address challenging detection and recognition problems of handwritten Chinese text ...